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ABSTRACT 

We present measurements of the excess mass-to-light ratio measured around MaxBCG galaxy clus- 
ters observed in the SDSS. This red sequence cluster sample includes objects from small groups with 
M200 ~ 5x 10 12 /i _1 Mq to clusters with M200 ~ 10 15 h^ 1 M Q . Using cross-correlation weak lensing, we 
measure the excess mass density profile above the universal mean Ap(r) = p(r) — p for clusters in bins 
of richness and optical luminosity. We also measure the excess luminosity density A£(r) = £(r) — £ 
measured in the z = 0.25 z-band. For both mass and light, we de-project the profiles to produce 
3D mass and light profiles over scales from 25/i _1 kpc to 22h~ x Mpc. From these profiles we cal- 
culate the cumulative excess mass AM(r) and excess light AL(r) as a function of separation from 
the BCG. On small scales, where p(r) ^> p, the integrated mass-to-light profile (AM / AL)(r) may 
be interpreted as the cluster mass-to-light ratio. We find the (AM/AL)2oo> the mass-to-light ratio 
within ^200, scales with cluster mass as a power law with index 0.33 ± 0.02. On large scales, where 
p(r) ~ p, the AM/ AL approaches an asymptotic value independent of cluster richness. For small 
groups, the mean (AM/AL)2oo is much smaller than the asymptotic value, while for large clusters 
(AA//AL)2oo is consistent with the asymptotic value. This asymptotic value should be proportional 
to the mean mass-to-light ratio of the universe {M/L). We find (M/L) bj£, L = 362 ± 54ft, (statistical). 
There is additional uncertainty in the overall calibration at the ~10% level. The parameter b 2 M ^ L is 
primarily a function of the bias of the L < L* galaxies used as light tracers, and should be of order 



unity. Multiplying by the luminosity density in the same bandpass we find Qmb M / L - 
independent of the Hubble parameter. 

Subject headings: dark matter — galaxies: clusters: general — gravitational lensing 
structure of the universe 



0.20 ±0.03, 
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1. INTRODUCTION 

Comparison of luminous to total mass thr ough the 
mass- to- light ratio (M/L) dates back at least to lKapte"vnl 
(1922), who found values of M/L ~ 2 M /L o in the solar 
neighborhood. Similar values we re found by H ubble for 
the disk of Andromeda in 1929 (jHubbld fl929h . Exten- 
sion of this approach to larger scales was fi rst attempted 
by Zwicky in 1933 for the Coma cluster (|Zwickvlll933L 
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Il937f h Under reasonable assumptions about the cluster's 
dynamical state, Zwicky estimated the mass of the clus- 
ter. The inferred average M/L for cluster galaxies was 
around 500 M q /Lq, implying a surprising dominance of 
dark over luminous matter. 

Decades of observation confirm Zwicky's conclusion 
that dark matter dominates on large scales. The simplest 
model capable of accommodating current observations is 
the Cold Dark Matter (CDM) model. It is assumed that 
approximately 80% of the cosmic mass density is in the 
form of some dynamically cold, collisionless species of 
subatomic particle. Monte Carlo predictions for the evo- 
lution of structure in such a CDM universe have reached 
a high level of sophistication. Given a set of cosmolog- 
ical parameters and initial conditions, modern simula- 
tions are accurate and predictive within well understood 
limitations imposed by resolution and simulation volume 
(see e.g. the comparison paper of iHeitmann et al.ll2007l . 
and references therein). However, accurate simulations of 
the baryonic components of clusters remain beyond both 
current theoretical understanding and simulation tech- 
nology. In the absence of direct theoretical prediction, 
observations of the connection between observable light 
and computable dark matter remain important. The 
simplest connection, still relevant 90 years after Kapteyn, 
is the mass-to-light ratio. 

Observations of both mass and light have advanced 
considerably since Zwicky's time. Dynamics of cluster 
galaxies still provide important constraints on cluster 
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masses. These have confirmed Zwicky's essential con- 
clusions with improv ed accuracy over a broader rang e 
of environments (e.g. ICarlberg. Yee. fc Elhngson]|1997f ). 
There are also new analysis techn iques such as the "caus- 
tics" method (Rincs ct al. 2004) and velocity stacking 
methods (e.g lBecker et al.l[20 07i) which confirm and ex- 
tend the standard analyses. 

The X-ray emission from the hot intra-cluster plasma 
has become a favored proxy for cluster mass. Under 
the assumption of hydrostatic equilibrium, masses and 
mass-to-light ratios can be calculated from X-ray tem- 
peratures and, less directly, luminosities. These X-ray 
analyses confirm the dy namical measurements of hi gh 
mass-to-light ratios (e.g. lLin, Mohr. fc Stanford! |2004[ ). 

Weak gravitatio nal tensing, a n effect which Zwicky 
thought promising (|Zwickvl [l937'l. is now an established 
technique for studying the mass distributions in clusters 
of galaxies. The effect, deflection of light from back- 
ground objects as it passes foreground clusters, is gener- 
ally less precise on a cluster-by-cluster basis than X-ray 
or velocity measurements. But it is complementary in 
that it is independent of the dynamical state of the sys- 
tem, is linear in the density, and can be measured at a 
large range of separations from the cluster center. 

Weak lensing studies have traditionally focused on 
measuring the mass of individual clusters 12 . How- 
ever, averaging the lensing signal over many clusters 
is more rob ust and easier to interpret (see the dis- 
cussions in I Johnston et all [2007at ISheldon et~aT1 120071 : 
I Johnston et~afll2007bf ). With this method, the full M/L 
profile can be measured from small scales to well beyond 
the bound regions of the clusters . Early stacked M/L 
results for group -sized objects (|Hoekstra et al.l 1200 ft 
iParker et al.ll2005l e.g.) s howed the promise of this ap- 



proach. The pilot study of ISheldon et al.l (|2001l ) demon- 
strated the gr eat potential of the Sloan Digital Sky 
burvey (SDSS; lYork et al.l 12001 for measuring cluster 
masses. 

In this work we measure the mean mass-to-light ratios 
for SDSS grou ps and clusters drawn from the MaxBCG 
cluster sample (jKoester et al.ll2007bl lal) . We compare the 
ensemble mass estimates from the first two papers in 
this series (|Sheldon et all l2007t Uohnston etafl l2007bf l 
with ensemble measurements of the total light. We mea- 
sure the mean mass using lensing, and the mean lumi- 
nosity by correlating the clusters with the surrounding 
galaxies. Each measurement spans a range of separations 
from 25/i _1 kpc to 22/i~ 1 Mpc, extending the volume over 
which the M/L is measured well beyond the virial radii 
of the clusters. 

Because lensing is not sensitive to uniform mass distri- 
butions, aka "mass sheets" , we measure the mean mass 
density of a lens sample above the mean density of the 
universe Ap=p(r) — p (actually we measure a 2D pro- 
jection of the 3D Ap which we will de-project). In 
other words, we measure the cluster- mass cross corre- 
lation function times the mean density of the universe 
Ap = p^cm- On small scales this is a measure of the mean 
density profile, but on large scales, where the density ap- 



12 A few exam pies: iFahlman et al l (119941); ITvson fc Fischeil 
ILuppino fc Kaiseii 1119971) : I Fischer fc TvsorJ (|1997|) 
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proaches the background, it can only be interpreted in 
terms of the correlation function. 

We measure the light using a stacking technique di- 
rectly comparable to the lensing measurements. We in- 
clude light from all galaxies surrounding clusters with 
luminosity above a threshold, and then subtract the mea- 
surements around random points in order to remove the 
uniform background. This means we measure the lumi- 
nosity density above the mean A£ = £(r) — £, so it is 
a correlation function just like the mass measurement . 
The excess mass within radius r divided by the excess 
luminosity within radius r is the ratio of integrals of cor- 
relation functions: 

AM _ Jodr[p(r) - p] r 2 _ _g dr p j cm r 2 

A£ [ ^ f Q dr [ £(r) -I] r 2 f Q dr I U r 2 ' 

On the scale of virialized halos, defined where the mean 
density is a few hundred times the mean, equation [T] can 
be interpreted as the mean cluster M/L in a straightfor- 
ward way. On very large scales, as the density approaches 
the mean, equation [T] becomes proportional to the mean 
mass-to-light ratio of the universe. The proportionality 
constant is related to the particulars of the correlation 
function measurements. For example, the light and mass 
may be clustered differently , and this difference may de- 
pend on the properties of the galaxies chosen as tracers 
of the light. Thus we expect this measurement to de- 
pend on the "bias" of the light tracers relative to the 
mass. This bias depends primarily on the mass of the 
halos hosting these tracers. The bias also depends on 
the variance of the mass density field (a$) as the larger 
the v ariance in th at field the smaller the bias at fixed 
mass (|Kaisejll98l ). 

Full theoretical models of this measurement on all 
scales will require substantial effort. For this paper we 
will parametrize our ignorance at large scales in terms of 
the bias. We will write the asymptotic M/L as 



(AM/ AL) asym = (M/L) b- 2 ^ 



b~ 2 - 

U M/L 



b c£ b 2 n 



(2) 



Hoekstra etaU (119981); IJoffre et al.l (2000); Clowe ct al. (2000) 
20021) ; IWittman et al.1 (120031) ; IXJmetsu et"ail (120051 ) 
(2006). 
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Here (M/L) is the mean mass-to-light ratio of the uni- 
verse, but it is multiplied by a bias factor b^ L . This 
factor depends on the bias of the galaxy tracers rela- 
tive to the mass &£ m , which should be of order unity 
for tracers near L* since these galaxies are expected 
to be only slightly an ti-biased (jSheth fc Tormenl 119991 : 
iSeljak fc Warren] l2004f h This term also depends on the 
ratio of the bias of clusters relative to mass and light 
bcm/bci, which should also be near unity for the same rea- 
sons. The cluster bias is likely to cancel from this equa- 
tion because the cluster terms appear in a ratio b cm /b C £. 

On large enough scales, AM/AL should approach a 
constant value independent of the halo mass as long as 
the bias of the light tracers, and thus b 2 M , L , is equivalent 
in all cases. This appears to be the case, as we will 
demonstrate. 

We assume a Friedman-Robertson- Walker cosmology 
with Q M = 0.27, fl A = 0.73, and H = 100 h km/s/Mpc 
and distances are measured in physical, or proper units, 
rather than comoving units. 
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2. METHODS 

2.1. Lensing Methods 

We will briefly describe the len sing measurem e nts as 
thes e were described in de tail in I Sheldon et al.1 (|2007l ) 
and I Johnston et all (|2007bD . We measure the tangential 
shear induced in background galaxies by a set of fore- 
ground clusters and convert that shear to a redshift in- 
dependent density contrast AS: 

yr{R) x Ecrit = E(< R) - E(i?) = AE , (3) 
where S is the projected surface mass density at radius R 
and E(< R) is the mean projected density within radius 
R. In using this equation, we assume the shear is weak. 
This is not always the case for the lar gest clusters on 
the sm allest scales, as was discussed in I Johnston et all 
(|2007bh . 

We average AE for an ensemble of clusters in radial 
bins from 25ft. _1 kpc to 30ft-~ 1 Mpc, using the brightest 
cluster galaxy (BCG) as the center for all measurements 
(see H3.ll for a description of the cluster selection) . The 
AE is linear in the density and so averaging AE averages 
the density directly (again assuming weak shear). 

A number of correct ions were made to t he profile, as 
discussed in detail in (|Sheldon et al.|[2~007t ). Using ran- 
dom points, corrections were applied for contamination 
of the background source sample with cluster members 
that are not sheared; this correction is large but well- 
determined on small scales, and negligible on large scales. 
We also corrected for residual additive biases in the shear. 
These result from imperfect correction for biases in the 
galaxy shapes caused by PSF anisotropy. We use ran- 
dom points for this correction as well, as the additive 
biases will appear in random points as well as around 
clusters. This correction is negligible on small scales but 
significant on large scales. 

There are remaining uncertainties in the overall cal- 
ibration of the mean AE. These come primarily from 
uncertainties in photometric redshift determinations for 
the background source galaxies and correction for blur- 
ring of the galaxy shapes by the PSF. Simulations of the 
PSF correction suggest it is good to a few pe rcent under 
simplified circumstances (|Massev et al.l 120 06). The pho- 
tometric redshift calibrations are probably less well con- 
strained. Comparisons with studies using Luminous Red 
Galaxies as sources, for which the redshift is better deter- 
mined, suggest the calibr ations are good to about 10% 
( Mandclb aum et al J 12005( 1 . although that study used a 
different photoz algorithm that this work. Simulations of 
the algorithm used in this study al so suggest ^10% errors 
given the size of the training set ( Lima eFaDEOOl). 

In H8.5I we will present inversions of AE to the in- 
tegrated excess mass. These mass profiles are model- 
independent, but as with the light measurements pre- 
sented in W2.21 only the excess above the mean can be 
measured with lensing. 

2.2. Cluster-light Correlations 

We used the method of iMasjedi et all (|2006( l to esti- 
mate the mean number density and luminosity density 
of galaxies around clusters. This method is essentially a 
correlation function with units of density: corrections are 
made for random pairs along the line of sight as well as 
pairs missed due to edges and holes. We defined two sam- 
ples: The primary sample denoted p and the secondary 



sample denoted s. In our case, the primaries were galaxy 
clusters with redshift estimates and the secondaries were 
the imaging sample with no redshift information, but in 
what follows we will use a more general notation. For 
example, the counts of real data secondaries around real 
data primaries is denoted D p D s , while the counts of real 
secondaries around random primaries is R p D s . If instead 
of counting "1" for each object, we count some other 
quantity such as the luminosity of the secondary, we say 
we have weighted the measurement by luminosity. 

The mean luminosity density of secondaries around pri- 
maries is 



£ w 



DpDs 
DpR s 



R p D s 
RpR s 



(4) 



where £ is the mean luminosity density of the secondary 
sample, averaged over the redshift distribution of the pri- 
maries, and w is the pr ojected correla ti on fun ction. This 
is the estimator from Masjedi et al.l (|2006f ) where the 
weight of each primary-secondary pair is the luminosity 
of the secondary. We have written the measurement as 
£w to illustrate that the estimator gives the mean density 
of the secondaries times the projected correlation func- 
tion w. Only the excess luminosity density with respect 
to the mean can be measured. Using a weight of unity 
gives the number density. 

The first term in equation [4] estimates the total lumi- 
nosity density around the clusters including everything 
from the secondary imaging sample projected along the 
line of sight, and the second term corrects for the random 
pairs along the line of sight. Note, the same secondary 
may be counted around multiple primaries (or random 
primaries). 

The numerator of the first term in equation 21 D p D s , 
is calculated as: 



D P D S 



T "'-' L ' S ={L pair ) + {L* ir ) = fA £ + 



N.„ 



(5) 

where the sum is over all pairs of primaries and sec- 
ondaries, weighted by the luminosity of the secondary. 
The secondary luminosity is calculated by K-correcting 
each secondary galaxy's flux assuming it is at the same 
redshift as the primary (see H3.4I for details of the K- 
corrections). The total luminosity is the sum over corre- 
lated pairs (Lp a i r ) as well as random pairs along the line 
of sight {Lpair)- By the definition of w, this is the total 
luminosity per primary times w + 1. This can be rewrit- 
ten in terms of the luminosity density of the secondaries 
£ times the area probed A. Some fraction of the area 
searched around the lenses is empty of secondary galax- 
ies due to survey edges and holes. The factor / represents 
the mean fraction of area around each primary actually 
covered by the secondary catalog. It is a function of pair 
separation, with a mean value close to 1 on small scales 
but then dropping rapidly at large scales. 

Again, a single secondary may be counted around mul- 
tiple primaries and K-corrected to different redshifts. 
Statistically they will only contribute when paired with a 
physically associated object due to the background sub- 
traction described below. In fact most of the calculations 
involved in this measurement are for pairs that are not 
physically associated, which is part of the reason this is 
computationally difficult. 
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The denominator of the first term in Equation [4] cal- 
culates the factor fA, the actual area probed around the 
primaries. This term in the denominator corrects for the 
effects of edges and holes. Also, because it has units of 
area, we recover the density rather than just the correla- 
tion function. 



D p R s 



N DR 

pair 



p\dA) 



dN 

dn 



fA. 



(G) 



The numerator is the pair counts between primaries and 
random secondaries, and the denominator is the expected 
density of pairs averaged over the redshift distribution of 
the primaries, times the number of primaries. The ratio 
is the actual mean area used around each primary fA. 

The second term in equation [4] accounts for the ran- 
dom pairs along the line of sight. The numerator and 
denominator are calculated the same way as the first 
term in equation [4j but with fake primaries distributed 
randomly over the survey geometry. The redshifts are 
chosen such that the distribution of redshifts smoothed 
in bins of Az = 0.01 match that of the clusters. 



R P D S 



2—ipr \ dAJ 



= f A l (7) 
= f R A R . (8) 



pair 



N R % 

jpr \ dAJ pr d(l 



The ratio of these two terms, R P D S / RpR s , calculates the 
mean density of the secondaries after correcting for the 
survey geometry. 

The density measured with this technique could be tab- 
ulated in various ways, typically as a function of pro- 
jected radial separation R. We tabulate in a cube which 
represents bins of separation R, luminosity L, and color 
g — r. This facilitates the study of the radial dependence 
of the luminosity function and the color-density relation. 

These profiles can be inverted to the three-dimensional 
excess density and integrated to get the total excess light. 
We will present this formalism in section l8Tl 

3. DATA 

The d ata used for tensing was described in detail in 
paper I (ISheldon et alJ l2007fl and the cluster sample is 
described in iKoester et al.l (l2007blfah w ith the modifica- 
tions detailed in ISheldon et all (|2007f) . We will briefly 
describe relevant details of the cluster sample and give 
a full description of the galaxies used as light tracers in 
the cluster-light cross-correlation measurements. All the 
primary data in this s tudy come from the Sloan Digi- 
tal Sky Survey (SPSS: lYork et al.l l200l data release 4 
(jAdelman-McCarthv et al.ll2006fh 



3.1. Cluster Sample 

Full d etails of the c l uster fi nder and catalog can be 
found in IKoester et al.l (2007b. a). The cluster finder is a 
red-sequence method, limited to the redshift range 0.1- 
0.3. The basic galaxy count ./V200 is the number of galax- 
ies on the red sequence with rest-frame i-band luminosity 
L >0.4L* within T^qq ■ The i-ba nd used in the cluste r 
finder is the z = 0.1 value from iBlanton et all (|2003bf ). 
corresponding to M* - 5log 10 h = -20.82 ± 0.02. The 
radius r^^ is determined from the size-richness relation 



presented in lHansen et alJ (|2005D . Note, this relation 
gives roughly a factor of 2 larger radius than the rV%$\ ss 
determ ined from the mass in Paper II (|Johnston et al.l 
2007b) (sec JTjfor details about the various richness mea- 
sures) . The published catalog contains clusters with ./V200 
> 10, and we augment this catalog with ^200 > 3 ob- 
jects. The cluster photometric redshifts are accurate to 
0.004 over our redshift range with a scatter of Az ~ 0.01 
for 7V200 > 10; the scatter degrades to Az ~ 0.02 for 
N200 > 3, with the same accuracy. 

3.2. Galaxy Sample 

For the cluster-light cross-correlations we separated 
galaxies from stars usin g the B ayesian techniques devel- 
oped in [Scrantoneian HH) . The primary source of 
confusion in star-galaxy separation at faint magnitudes 
is shot noise. Stars scatter out of the stellar locus and 
galaxies scatter into the stellar locus. This technique 
uses knowledge of the true size distribution of stars and 
galaxies as a function of apparent magnitude to assign 
each object a probability of being a galaxy. 

We characterized the distribution of star and galaxy 
sizes as a function of magnitude and seeing using the 
deeper southern SDSS stripe. The SDSS southern stripe 
has a been repeatedly scanned. We chose regions of sky 
which were scanned at least 20 times and chose the 20 
best-seeing observations for each object. We then simply 
added the flux at the catalog level to increase the S/N. 
Thus, the selection is close to that of single scans but 
with a better measurement; S/N at a given magnitude 
is higher, and the distribution of measured sizes is closer 
to the truth. For a range of seeing values, we then cal- 
culated the probability that an object in a single-epoch 
image with a given magnitude and size is truly a galaxy, 
and applied this to all objects in the survey. The result- 
ing distribution is highly peaked at probability and 1, 
such that our chosen probability cut at p > 0.8 results 
in a sample >99% pure within our magnitude limit. 

We chose to K-correct to band-passes shifted to the 
mean cluster redshift 0.25 rather than redshift to mini- 
mize the corrections. We will refer to all such magnitudes 
with a superscript, e.g. 25 z. For a full discussi on of the 
band-shifting process see IBlanton et alJ (|2003ah . 

We chose a volume and magnitude limited galaxy sam- 
ple for Lo.2 H >10 95 h~ 2 L Q (orMo. 25i -51og 10 /i<-19.08). 
and z < 0.3. This corresponds to an apparent magnitude 
limit of i < 21.3, and color limits of g—r < 2 and r— i < 1. 
All magnitudes are SDSS model magnitudes. 

3.3. Survey Geometry 

We characterized the survey geometry using the 
SDSSPix code 13 . This code represents the survey using 
nearly equal area pixels, including edges and holes from 
missing fields and "bad" areas near bright stars. We re- 
moved areas with ex tinction greate r than 0.2 magnitudes 
in the dust maps of ISchlegel et alJ ([1998). This window 
function was used in the cluster finding and in defining 
the galaxy catalog for the cross correlations. By includ- 
ing objects only from within the window, and generating 
random catalogs in the same regions, we controlled and 
corrected for edges and holes in the observed counts as 
described in Sj2] 

13 http:/ /lahmu. phyast.pitt.edu/~scranton/SDSSPix/ 
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3.4. K-corrections 

We calculate d K-corrections using the template code 
kcorrect from iBlanton et all (|2003aj ). This code is ac- 
curate but too slow to calculate the K-corrections for the 
billions of pairs found in the cross-correlations. To save 
time we computed the K-correction on a grid of colors in 
advance. We took galaxies from the SDSS Main sample 
as representative of all galaxy types. We then computed 
their K-corrections on a grid of redshifts between and 
0.3, the largest redshift considered for clusters in this 
study. The mean K-correction ina21x21x80 grid of 
observed g—r, r—i, and z was saved. We interpolated this 
cube when calculating the K-correction for each neigh- 
boring galaxy. This interpolation makes the calculation 
computationally feasible for this study, but is still the 
bottleneck. 

4. CLUSTER GALAXY POPULATION MEASUREMENTS 

4.1. Radial, Color and Luminosity Binning 

Using the estimator presented in §2.2[ we measured 
the number and luminosity density of galaxies around 
the clusters as a function of radius R from the BCG, 
color °' 25 (<7 — r), and luminosity Lo.25j. The bins in ra- 
dius, color, and luminosity form a data cube with 18 
bins in radius, 20 bins in color and 20 bins in luminosity. 
The range in each variable is 0.02 < R < W.bh^ 1 Mpc, 
< 25 ( 3 - r)< 2 and 9.5 <\og w (Lo.2 H / L Q ) < 11.7. 
The initial cube before background correction was kept 
separately for each cluster to allow flexibility when later 
estimating the average profile. Two versions of this cube 
were kept, one with number counts and another with 
the total luminosity. In other words, in the first case 
we counted each galaxy and in the other we counted the 
luminosity. Another set of measurements was also per- 
formed to i0h~ l Mpc, but binning only in radial separa- 
tion to save resources. More detailed analy sis of the full 
cube will presented in lHansen et al.1 (|2007r ): here we will 
present what is needed for the particulars of the M/L 
measurement. 

4.2. Random Catalogs 

We generated random catalogs uniformly over the sur- 
vey area using the window function described in ^3.3t 
We chose the redshift distribution to be that of constant 
density in comoving volumes over the redshift range of 
the clusters. We matched subsets of these redshifts to 
the redshift distribution of each cluster subsample as de- 
scribed in §4.31 

We performed the same galaxy counting as described 
in ij4.ll for a set of 15 million random points in order to 
correct for the random background. These are used in the 
RD term from the estimator described in $2] (Equations 
[4] and [7]). We also ran sets of 15 million random points 
for the DR and RR terms. 

4.3. Histogram Matching 

The redshifts used for random points must statisti- 
cally match that of each cluster sample in order for 
the background subtraction to be accurate. The ran- 
dom primaries described in £14.21 were generated with 
constant comoving density. We drew random redshifts 
from this sample such that the distribution matched 
that of the cluster sample when binned with Az = 
0.01. 
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Fig. 1. — Mean fractional area searched relative to the total area 
in each radial bin for the 8th cluster richness sample (12 <-/V200< 
17). Edges and holes dilute the true galaxy counts, biasing the 
measured density. This effect is negligible on small scales, but 
becomes important for large separations when a higher fraction of 
clusters are closer to the edge than the search radius. Due to small 
area at small separations, the value is not well determined, but 
must approach unity smoothly. We model this with a polynomial 
constrained to be unity on small scales, shown as the solid curve. 
For larger scales no model is needed. 



4.4. Cluster Richness Binning 

Because the measurements before background correc- 
tion were saved for each cluster separately, the clusters 
could be binned at a later time to produce mean den- 
sity profiles. For this work we binned the clusters into 
12 bins of A200 and 16 bins of i-band cluster luminosity 
£200 > where the luminosity is that of the -/V200 red galax- 
ies counted within r^oo as described in §3.11 These bins 
were chosen to correspon d with the binning p resented 
in the lensing analysis of ISheldon et al.l (|2007f ). Some 
statistics of this sample are shown in Tables [T] and 

4.5. Corrections for Edges and Holes 

The terms D p R s and R P R S introduced in W2. 21 correct 
for the survey edges and holes by measuring the actual 
area searched. An example DR is shown in Figure[TJ gen- 
erated for one of the richness bins described in H4.4I This 
is expressed as the mean fractional area searched relative 
to the area in the bin. For small separations, edges and 
holes make little difference so the fractional area is close 
to unity. On larger scales edges are important. 

On small scales the number of pairs in each bin is rel- 
atively small so the correction is not as well constrained. 
However, we know that the fractional area must approach 
unity smoothly, and this can be seen from visual inspec- 
tion. To smooth the result, we fit a fifth order poly- 
nomial, constrained to be unity on small scales, to the 
fractional area as a function of the logarithmic separa- 
tion. Due to the weighting, this results in a curve that 
approaches unity smoothly on small scales, yet matches 
intermediate separation points exactly. Points on larger 
scales are well-constrained and do not need smooth- 
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TABLE 1 
AM/AL Statistics for Afaoo Bins 



AL 



2(111 



(Af 2 oo> AL200 AM200 

3 3.84 ±0.02 3.99 ±0.16 

4 4.86 ±0.04 5.78 ±0.26 

4 5.98 ± 0.07 8.84 ± 0.44 

5 7.22 ± 0.09 12.88 ± 0.64 

7 7.99 ±0.12 12.54 ±0.78 

8 9.16 ±0.15 16.58 ±1.20 

9 11.51 ±0.14 26.88 ±1.03 
13 16.23 ±0.22 42.67 ±1.39 
20 23.74 ±0.38 70.18 ±2.54 
31 36.88 ±0.80 131.66 ±5.02 
50 53.82 ± 1.95 203.71 ± 9.57 379 ± 43 
92 104.33 ±5.16 516.19 ±38.78 489 ± 75 



101 ± 10 
108 ± 14 
166 ± 18 
170 ± 22 
122 ± 27 
179 ± 30 
229 ± 22 
257 ± 21 
319 ± 27 
333 ± 32 



I AM_\ 

V / rmax 

275 ±313 
347 ± 229 
455 ± 234 
341 ± 183 
540 ± 231 

353 ± 157 

354 ± 140 
327 ± 133 
321 ± 145 
300 ± 175 
525 ± 307 
547 ± 332 



{L<> al ) 

0.98 ±0.10 
1.02 ±0.10 
1.02 ±0.11 
0.93 ±0.12 
1.10±0.11 
1.10 ±0.11 
1.04 ±0.10 
1.08 ±0.09 
1.24 ±0.11 
1.27 ±0.13 
0.94 ±0.24 
0.93 ±0.53 



ri/2 

0.25 ± 0.07 
0.44 ±0.15 
0.47 ±0.16 
0.24 ± 0.05 
0.30 ± 0.31 
0.44 ±0.12 
0.30 ± 0.05 
0.28 ± 0.05 
0.19 ± 0.03 
0.20 ± 0.04 
0.15 ± 0.02 
0.11 ± 0.01 



a 

1.34 ±0.11 

1.05 ± 0.08 

1.06 ±0.08 
1.46 ±0.13 
1.02 ±0.14 
1.16 ± 0.11 
1.27 ± 0.09 
1.12 ± 0.08 
1.37 ±0.11 

1.35 ±0.16 
2.19 ±0.29 
3.91 ± 0.65 



AM 
^ / asyn 

226 ± 33 
306 ± 42 
357 ± 44 
285 ± 28 
271 ± 59 
378 ± 41 
342 ± 24 
355 ± 24 
349 ± 21 
394 ± 26 
404 ± 21 
485 ± 33 



Note. — Mass-to-light ratio statistics for clusters binned by richness A^oo- AL is the excess light over the mean luminosity density 
of the universe, and AM is the excess mass over the mean mass density. Subscripts 200 and rmax indicate quantities within T200 an d 
the maximum radius 22/i~ 1 Mpc, respectively. The subscript asym refers to an asymptotic value from fitting the model described in 
the text. Other parameters of this fit are ri/2 an d the half-light radius and power law index. The model is not a physical model, so 
these uncertainties should be considered lower limits. L 9al is the mean luminosity of the neighboring galaxies used to calculate AL. 
No attempt was made to account for possible offsets between the BCGs used as centers and the halo mass peak; this would affect the 
AM200 but should have little effect on (AA-f/AL)200i an d no effect on (AM/AL)22Mpc- Masses are in units of /i~ 1 ALq, luminosities 
in units of h~ 2 10 10 Lq and mass-to-light ratios in units of /iMq/Lq. r\/i is in units of /t -1 Mpc. The mean richness is shown but the 



; can be found in lSheldon et al.l 1 120071) 



TABLE 2 

AM/AL Statistics for L 2 oo Bins 



(£200} 



AL 2 



AM 2l 



AM 



211" 



5.6 
7.0 

8.7 

10.8 

13.5 

16.9 

21.1 

26.3 

32.9 

40.9 

51.2 

64.1 

79.9 

98.7 

124.6 

184.7 



3.75 ± 0.04 
4.63 ± 0.04 
5.98 ±0.06 
7.62 ± 0.07 
9.48 ±0.10 
11.82 ± 0.13 
14.12 ±0.19 
17.77 ±0.27 
21.75 ±0.35 
26.39 ±0.56 
33.96 ± 0.84 
45.66 ± 1.23 
51.09 ± 1.94 
56.53 ±3.12 
81.86 ±5.88 
123.86 ± 9.07 



4.51 ±0.30 
5.32 ± 0.32 
7.64 ± 0.40 
11.80 ±0.53 
15.68 ±0.70 
23.68 ± 0.94 
29.34 ± 1.26 
41.87 ± 1.76 
54.22 ± 2.46 
72.34 ± 3.31 
105.56 ±5.22 
162.02 ± 8.44 
172.42 ± 10.72 
194.84 ± 15.52 
266.15 ±23.66 
593.94 ± 44.55 



114 ±20 
113 ± 18 
116 ± 18 
169 ± 18 
129 ± 19 
193 ± 19 

214 ± 23 

215 ± 25 
265 ± 27 
279 ± 32 
284 ± 39 
354 ± 42 
339 ± 52 
291 ± 74 
370 ± 70 
476 ± 72 



!am\ 

V / rmax 


(L!> al ) 


n/2 




a 




363 ± 263 


1.04 


± 





.11 


0.70 


± 


0.50 


0.94 


± 


0.08 


260 ± 238 


1.03 


± 





.11 


0.39 


± 


0.19 


1.15 


± 


0.11 


458 ± 213 


1.01 


± 





.11 


0.18 


± 


0.05 


1.44 


± 


0.15 


283 ± 146 


1.11 


± 





.12 


0.42 


± 


0.12 


1.09 


± 


0.08 


405 ± 187 


1.05 


± 





.11 


0.52 


± 


0.13 


1.06 


± 


0.06 


374 ± 173 


1.11 


± 





.11 


0.52 


± 


0.15 


0.98 


± 


0.07 


526 ± 184 


1.16 


± 





.11 


0.33 


± 


0.12 


1.08 


± 


0.10 


332 ± 137 


1.15 


± 





.11 


0.33 


± 


0.09 


1.13 


± 


0.10 


408 ± 223 


1.02 


± 





.12 


0.21 


± 


0.05 


1.33 


± 


0.13 


315 ± 159 


1.30 


± 





.15 


0.22 


± 


0.05 


1.28 


± 


0.13 


237 ±218 


1.24 


± 





.16 


0.24 


± 


0.09 


1.15 


± 


0.15 


225 ± 224 


1.20 


± 





.21 


0.32 


± 


0.14 


1.17 


± 


0.15 


347 ± 301 


1.28 


± 





,16 


0.18 


± 


0.03 


1.65 


± 


0.25 


384 ± 286 


1.11 


± 





.61 


0.12 


± 


0.04 


2.53 


± 


0.63 


-286 ± 474 


1.12 


± 





.67 


0.15 


± 


0.03 


5.67 


± 


2.66 


626 ± 327 


0.79 


± 





.54 


0.24 


± 


0.05 


1.49 


± 


0.16 



AM ' 
. AL , 



465 ± 96 
300 ± 56 
200 ± 25 
315 ± 36 
357 ± 38 
399 ± 42 
319 ± 36 
357 ± 35 
329 ± 28 
343 ± 27 
368 ± 41 
451 ± 56 
366 ± 29 
321 ± 29 
324 ± 38 
456 ± 40 



Note. 



Same as tabic ^ for clusters binned by L200- 



mg. 



5. LUMINOSITY DENSITY AT Z=0.25 



We are interested in the background luminosity den- 
sity at z=0.25, in the °- 25 i band, for comparisons with our 
luminosity measurements. We use an evolved version of 
the SDSS spectroscopic sample (whose median redshift is 
about z ~ 0.1). For this purpose, we use the DR4 version 
of the New Yor k University Value-A dded Galaxy Catalog 
fNYU-VAGC: IBlanton et all 120051) . We select a subset 
of the galaxies in the redshift range 0.01 < z < 0.25, in 
the apparent magnitude range 14.5 < m r < 17.6. The 
NYU-VAGC provides the angular completeness map nec- 
essary to calculate for each galaxy the quantity 7 ma x, the 
maximum volume over which each galaxy could be ob- 
served. We do so using the same method as used by 
IBlanton! (|2006h . which accounts for the evolution and 
X-corrections within th is redshift range. We u se the 
kcorrect v4_l_4 code of lBlanton fc Roweii (|2007ft to es- 
timate the 25 i band, Galactic extinction-corrected, K- 



corrected absolute magnitude of each galaxy, based on 
the model fluxes provided by SDSS. 

Finally, we evolution-correct thes e magnitudes i n the 
following way, based on the results of lBlantonl (2006) . For 
each galaxy we apply a simple correction of the form: 

Mo.2 H (z = 0.25) = Mo. 2H (z) + A(z - 0.25) (9) 
For galaxies on the blue sequence, based on their °' 25 (g — 
r) colors, we use A — 0.65. For galaxies on the red se- 
quence we use A = 0.35. The red-blue split is defined by 
the line °- 25 ( 3 -r) = 1.2-0.05(A/o.25 r + 20). These values 
are calculated by evaluating th e simple star-formation 
history models of lBlantonl (|2006f) . These models explain 
the evolution of the blue and red sequences in the 01 g 
band well, and we use the corresponding predictions for 
the °- 25 i band. In practice, these corrections are quite 
small (at most 0.16 mag), and so the inevitable uncer- 
tainty in this correction is likely to be unimportant. 

By using the evolution-corrected magnitudes and 
weighting each galaxy by 1/Vmax, we estimate the lu - 
minosity function using the method of lSchmidtl ([1968). 
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The resulting luminosity density for galaxies above our 
luminosity threshold l0 9 - 5 h~ 2 L Q is (1.61 ± 0.05) x 10 s h 
L Q Mpc -3 in comoving coordinates, or (3.14±0.10) x 10 8 
h LqMpc -3 in physical coordinates. Our luminosity 
limit is equivalent to — 5 log 10 h < -19.08, where we 

use Mq = 4.67 in the °' 25 i band. The uncertainty is dom- 
inated by the absolute calibration of the SDSS. Fitting 
a single Schechter function to the luminosity function, 
we find that M* - 51og 10 h ps -20.9 (with a « -1.21), 
equivalent to L* = 1.7 x 10 10 /i _2 i Q . Thus the lower 
luminosity threshold corresponds to 0.19L*. 

6. SIMULATIONS 

In order to study the impact of the MaxBCG al- 
gorithm on our conclusions, and in particular the dif- 
ferences between how our method operates on dark 
matter halos and on MaxBCG clusters, we have re- 
peated the luminosity measurements on a mock cata- 
log. These catalogs, which have been used in previ- 
ous M a xBCG studies (iRozo et al1l2007bl lal: iKoester etaLl 
l2007bt Uohnston et al.ll2007b[ ). populate a dark matter 
simulation with gal axies using the ADDG ALS technique, 
to be described in IWechsler et al.l ([2008). This method 
is designed to populate large volume simulations with 
galaxies that have realistic luminosities, colors, cluster- 
ing properties, and galaxy clusters. 

The catalog is based on the light-cone f rom the Hub- 
ble Volume simulation (|Evrard et al.ll2002T) . and extends 
from < z < 0.34. Galaxies are assigned directly to dark 
matter particles in the simulation, with a luminosity- 
dependent bias scheme that is tuned to match local clus- 
tering data. First galaxy luminosities are generated in 
the z=0.1 shifted r-band , draw n from the luminosity 
function of Blanton ct al. (2003bj). The luminosity func- 
tion is assumed to evolve passively, with 1.3 magnitudes 
of evolution in M* per unit redshift (M*(z) = M*(z = 
0.1) — 1.3(z — 0.1)). Particles in the simulation are then 
assigned these luminosities based upon the following pre- 
scription. 

We measure the local mass density around each dark 
matter particle, defined here as the radius enclosing a 
mass scale of ~ 1 x 10 13 M Q . For sets of points binned 
by local mass density, we measure the auto-correlation 
function and the distance to the 5th nearest neighbor 
with the same local density. From galaxy surveys we 
know the correlation function as it depends on galaxy 
luminosity, so by finding the set of particles with a cor- 
relation function that matches that of galaxies in the 
real universe, we make a connection between the local 
mass density in the simulation and galaxy luminosity. 
We use this to parameterize the probability distribution 
of these dark matter densities as a function of luminos- 
ity, and constrain these parameters so that the resulting 
luminosity-dependent two-point clustering properties of 
the mock gal axies are in agreem ent with those measured 
in the SDSS (jZehavi et allhoOSl ). 

Once placed on a dark matter particle according to this 
prescription, each mock galaxy is then assigned to a real 
SDSS galaxy that has approximately the same luminosity 
and local galaxy density, measured here as the distance 
to the 5th nearest neighbor. The color for each mock 
galaxy is then given by the SED of this matched galaxy 
transformed to the appropriate redshift. The matching 
of local galaxy density helps to ensure the relationship 



between color and density is preserved. 

This procedure produces a catalog which matches sev- 
eral statistics of the observed SDSS population, includ- 
ing the location, width and evolution of the ridgeline in 
color-luminosity characteristic of galaxy clusters. The 
luminosity limit for galaxies in these mocks is slightly 
lower that of the MaxBCG cluster finder L >0.4L*, so 
the catalogs are well-designed for testing the MaxBCG 
algorithm. However, this limit is higher than for our lu- 
minosity measurements, so the results cannot compared 
at low luminosities. Therefore, in this paper the simula- 
tions are used strictly to understand the effects of cluster 
selection on our measurements (see ^8.21) . 

7. NOTATION 

The notation may get confusing due to the use of 
multiple methods and apertures in the course of clus- 
ter finding and lensing measurements. The basic nota- 
tio n for cluster variables , introduced in £13.11 is the same 
as ISheldon et al.l ([2007) : the measures of richness and 
luminosity we will refer to as A200 and £200- These 
are the counts and i-band luminosity for galaxies with 
Li > 0.4L„, colors consistent with the cluster ridge-line, 
and projected separation less than rfoQ S as calibrated in 
lHansen et al.l (|2005| ). Note, r^QQ was only used for A^oo 
and L20O; no other quantities in this paper use that aper- 
ture . For more informati on about the richness measures 
see (jKoester et al.|[20"07bl[a| ) . This radius is different from 
the radius r r {$f s calcul ated from the mass pro file, which 
is typically half as big ([Johnston et al.l 1200711 ): this dif- 
ferenc e is primarily due to diffe rences in convention be - 
tween lHansen et al.l ([2005D and Uohnston et al.l (|2007bf) : 
Hansen used projected over-densities relative to the mean 
luminosity density and Johnston used 3D over-densities 
relative to the critical mass density. The -L200 is only 
used for binning the clusters; because our results are es- 
sentially the same for L200 and A^oo binning we will avoid 
using the L200 notation except where necessary. 

We will refer to the total excess luminosity measured 
below, which includes the light of all types of galaxies 
above a luminosity threshold, as AL(r). This luminos- 
ity, and excess mass AM(r), are the new measurements 
presented in this paper. The total excess mass and light 
within rV^ ss are denoted AAf 2 oo and Ai 2 oo- Projected 
2D radii are referred to as R and 3D radii are referred to 
as r. 

8. RESULTS 

In the following sections we show the results for clusters 
binned by A200 ■ Similar results were obtained binning by 
-L20O; and we summarize all results in Tables Q] and [2l but 
for the sake of brevity we include plots only for the A^oo 
binning. 

8.1. The Radial Dependence of the Joint 
Luminosity-color function 

Figure [2] shows the joint color-luminosity distribution 
function for each radial bin in the 8th cluster richness 
sample (12 f^A^oof; 17). Similar distributions were cre- 
ated for each of the cluster richness bins used in this 
study. We will presen t deta iled analyses of these type of 
data in lHansen et al.l ([2007), but we also present a sam- 
ple here in order to demonstrate a few aspects important 
for the M/L study. 
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The first point is that the population is quite different 
at small scales relative to large scales. On small scales, 
near the BCGs, the galaxy population is dominated by 
red galaxies, while on large scales the color distribution 
looks more like the cosmological average. Similarly, on 
large scales the luminosity function looks more like that 
of the average, although they are poor fits to Schechter 
functions on large scales because we measure the ratio 
of correlation functions (see the following sections for 
more details). These facts are relevant to the M/L for 
a few reasons. We want to make sure that the popu- 
lation we are seeing around clusters makes sense; that 
we, for example, are complete in the color-luminosity 
ra nge we have probed. Again, detailed analysis will come 
in lHansen et al.l (|2007f) , but these color and luminosity 
trends are exactly what is expected. We will discuss the 
shape of the luminosity function in H8.2\ 

8.2. The Radial Dependence of the Luminosity function 

Projecting the two-dimensional color-luminosity plots 
from Figure [5] across the color axis produces the lumi- 
nosity function in each radial bin. Recall that to pro- 
duce these curves we have statistically subtracted the 
background, so the luminosity function is the luminosity 
function of galaxies minus that mean density of the uni- 
verse. The value of these functions in a luminosity bin i 
is 

(j> c ,i{R) = n(R)i - n t = faw^R), (10) 

where n is the number density, w Ci i(R) is the projected 
cross-correlation function between clusters and galaxies 
in luminosity bin i at projected radius R and 4>i is the 
value of the luminosity function of the universe in that 
luminosity bin i. 

Because of the statistical subtraction, the value of the 
luminosity function in luminosity bin i is weighted by 
the cross-correlation function of clusters with galaxies of 
that luminosity. This is important because galaxies of 
different luminosities correlate with clusters differently. 
An extreme example of this is demonstrated for the low- 
est L200 bin in Figure [3j which shows <fi c ,i in each radial 
bin. This Figure shows that, near the virial radius, the 
inferred luminosity function for small groups is actually 
negative for galaxies with 10 10 7 L/L Q . This is 

not because there is a negative number density of these 
galaxies around the groups, but because they are anti- 
correlated with the groups at this separation. This means 
that, near the virial radius, there are fewer of these high 
luminosity galaxies relative to the background. 

This effect is strongest in our lowest N200 bin, although 
there is a slight feature in the luminosity function at virial 
radius for high luminosity galaxies the first few cluster 
bins. 

In order to understand whether these effects are phys- 
ical, due to selection effects of the cluster finder, or are 
artifacts of our method, we ran the MaxBCG algorithm 
and our cross-correlation code on the simulation-based 
mock catalogs described in <|6] We performed this test 
twice, stacking on both cluster centers (BCGs) and halo 
centers. For the mock stacked on maxBCG clusters, a 
similar effect is seen, although it is suppressed relative 
to the effect in the real data. As in the data, the ef- 
fect is strongest in the lowest richness bin. When the 
measurement is done around halo centers, there is no 



significant effect seen. This indicates that it is mostly an 
effect introduced by the selection criteria of the MaxBCG 
algorithm. 

This lowest luminosity bin is peculiar in that it requires 
the close proximity of only three very luminous red galax- 
ies, one of which has extreme BCG-like luminosity. This 
an unusual situation; BCGs of this luminosity are usually 
surrounded by many more galaxies. In order to find only 
two such galaxies within a few hundred kpc of a BCG 
type galaxy, the algorithm selects objects embedded in 
slight under-densities. This can occur naturally in the 
MaxBCG algorithm due to the percolation step, which 
does not allow clusters to be embedded within larger 
clusters. This may limit low richness systems to very 
particular regions of space. 

8.3. Radial Luminosity and Number Density 

Further integrating the luminosity functions from sec- 
tion 18.21 across the luminosity axis results in the excess 
number density of galaxies. We have also generated lu- 
minosity density profiles using the luminosity weighted 
data cube rather than the cube of galaxy counts. These 
are shown in Figures 2] and [5] respectively. Each curve 
represents the excess density for each of the cluster L 2 oo 
bins. The errors come from jackknife re-sampling of the 
data following the sa me techniques used f or the lensing 
analysis described in ISheldon et al.1 (|2007l ) . We will say 
more about errors in section [9l 

Again, it is important to remember that these curves 
are background subtracted, and are thus the number and 
luminosity densities above the mean. These terms can be 
written in terms of correlation functions: 

n(R)-n = nw c , g (R) (11) 

t(R)-i=iw c , £ (R) (12) 

where n and £ are the mean number and luminosity den- 
sity of galaxies over the explored luminosity range, and 
w C: g(R) and w Ci i(R) are the projected cluster-galaxy and 
cluster-luminosity cross-correlation functions, averaged 
over all galaxies in the luminosity range at projected ra- 
dius -R, weighted by the luminosity function. 

It is tempting to interpret the ratio of the excess lumi- 
nosity to excess number densities as the mean luminosity 
as a function of radius. Given that there can be negative 
densities with respect to the mean at some luminosities 
when there are anti-correlations, as shown in Figure [31 
this is not always the correct interpretation. 

The curves in figures |4] and [5] show a number of fea- 
tures expected for two-point correlation functions. In 
particular, there should be a transition radius between 
correlations within the halo and between different halos. 
The scale of this break should correspond to the size of 
the larges clusters in each bin. We present no detailed 
analysis here, but the radius of the break we see does in- 
crease with cluster richness as expected. On small scales 
the profile is consistent with a universal profile and on 
large scales transitions to that expected for halo -halo cor- 
relations. We found in I Johnston et a l. (2007bj) that the 
mass profiles of the clusters were good fits models of this 
form. In this paper we do not use any explicit modeling, 
preferring to focus on model independent measurements, 
but these rich data should provide constraints for mod- 
els of galaxy formation and evolution in a cosmological 
setting. 
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Fig. 2. — Joint galaxy 0,25 (g — r) and °- 25 i-band luminosity densities as a function of projected separation from BCGs for the 8th cluster 
richness bin (12 <./V200< 17). The luminosities are expressed in the z=0.25 shifted bandpass. Each frame corresponds to a different radial 
bin; the radius is indicated in the legend. The one dimensional distributions for color and luminosity are also shown as the solid histograms 
along the left and bottom axes. The luminosity distribution is expressed as log of the number density as a function of log luminosity; 
the color distribution is linear density as a function of color. At small separations red galaxies dominate while on large scales there is a 
bivariate color distribution similar to the cosmological average. A smaller fraction of galaxies is highly luminous at small separations as 
compared to large. 
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Fig. 3. — Excess luminosity functions for the first cluster richness bin 7V200= 3. This is the integral across the color axis of the joint 
densities such as those shown for another richness bin in Figure [2] As in that Figure, each panel is a radial bin with mean radius indicated 
in the legend. Each bin in the excess luminosity function is the amplitude of the projected cluster-galaxy cross-correlation function at that 
radial separation times the mean density. In some bins the galaxies are anti-correlated. This cluster richness bin w as chosen as the extreme 
example of these anti-correlations; they are smaller or non-existent in higher richness bins. As discussed in SI8.2I this feature is a result of 
the MaxBCG selection function at low richness. 
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Fig. 4. — Excess radial number density for each of the cluster 
-/V200 bins. These curves are the integral of the excess luminosity 
functions as shown in Figure [3] 
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Fig. 6. — Excess 3D °' 25 i-band luminosity profiles for each of the 
cluster -/V200 bins. The 2D profiles from Figure[5]were de-projected 
and integrated over radius. The dashed curves without error bars 
include the BCG luminosity, the solid lines with errors do not. The 
mean r^QQ SS , determined from the mass profiles in i|8.5l is marked 
with an asterisk in each curve. 
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Fig. 5. — Excess radial 25 j-band luminosity density for each of 
the cluster A^oo bins. These curves are the integral of luminosity- 
weighted luminosity functions. 



8.4. Integrated Luminosity Profiles 

We will use integrated luminosity profiles to compute 
the mean M/L within a given three-dimensional radius r. 
We invert the projected two-dimensional profile shown 
in Figure El us ing a standard Abel type inversion (e.g 
iPlummedlmit ): 



M{r) = £(r) 



7T 



dR- 



Vr 2 -R 2 ' 



(13) 



where we have re-used the notation for the excess lumi- 
nosity density from equation [TT] but replaced projected 
radius R with three-dimensional radius r. We have been 
explicit here in indicating we measure the density minus 
the mean £(r) — t. Thus the A£(r) is proportional to the 
cluster-light cross-correlation function: 



The assumption behind the inversion in equation [13] 
is that £(R) is the line of sight projection of a spher- 
ically symmetric three-dimensional function £(r). This 
follows from the isotropy of the universe as long as the 
cluster selection function does not select structures pref- 
erentially aligned relative to us. Results from simula- 
tions suggest such preferential alignment is not impor- 
tant for our cluster sample where the sample can be 
tested iV200> 10- A publicat ion on these simulations is 
forthcoming (|Hao et al.ll2008l ). 

Because the maximum separation we measure is 30ft." 1 
Mpc rather than infinity, we cannot accurately calculate 
A£(r) over the entire range. We lose the last point en- 
tirely, and the second to last point, at 22/i _1 Mpc, must 
be corrected slightly for the endpoint. We perform a 
power-law extrapolation of the profile and find this to be 
a 5% upward correction. 

We then integrate the excess luminosity density to ob- 
tain the total luminosity within radius r. Because the 
profile only extends inwards to 25ft.~ 1 kpc, we are miss- 
ing some light interior to this radius. This light is domi- 
nated by the BCG however, so we can add that compo- 
nent back in as the average BCG luminosity for a given 
cluster sample: 



AL(< r) 



(L 



BCG) 



drAnr 2 A£{r) 



(15) 



where we have again been explicit in defining our mea- 
sured quantity as AL(< r), the integrated excess lumi- 
nosity above the mean density. 

Figure O shows this quantity AL(< r) for the richness 
and luminosity cluster binning. There are two curves for 
each richness bin: one including the mean BCG luminos- 
ity (Lbcg) and one without. The light is dominated by 
the BCG on small scales. The luminosity within r 



is marked with an asterisk, 
the mass profiles (see £18. 5ft 



The 



mass 
200 



200 



is calculated from 



A£(r)=£U(r) 



(14) 
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8.5. Integrated Mass Profiles 

The cluster mass profiles wer e measured from the lens- 
ing measurements prese nted in lSheldon et al.1 (|2007l ) and 
Uohnston et al. (2007b). These measurements were per- 
formed for the same samples presented above. The basic 
lcnsing measurement is AS, which is a projected quan- 
tity: 

AT, = £(< R) - t(R) , (16) 

where £ is the projected surface mass density at radius R 
and E(< R) is the mean projected density within radius 
R. The subtraction in this equation is a manifestation of 
the mass sheet degeneracy. 

In Uohnston et all (|2007bD we inverted the projected 
profiles to the three-dimensiona l excess mass density us - 
ing the techniques presented in I Johnston et all (|2007a). 
This inversion is a procedure analogous to the inversion 
of the luminosity density presented in 



d£ 
dR 



dAT 
dR 



AT 
> 

' R 



1 f c 

Ap(r) = p(r) - p = - I 

7T Jr 



dR 



-dT/dR 



VIP 



(17) 



Again, the assumption is that the profiles are projec- 
tions of spherically symmetric three-dimensional func- 
tions. We can recover the total excess mass within radius 
r, including that within our innermost radius, because 
AE is itself a non-local measurement: 



AM(< r)=nR 2 AT{R) 



2tt 



dr rAp{r) 



R 2 



Vr 2 - R 2 



(r - s/r 2 - R 2 ) 



(18) 

As with the luminosity inversions, the last point must 
be thrown out and a 5% correction is applied to the sec- 
ond to last point. Figure [7] shows these excess mass pro- 
files for the A^oo binnin gs. The error bars ar e from jack- 
knife re-sampling. Sec Shel don et al.l (|2007| ) for details 
about the error estimates, and <|9]for more details about 
er rors in this work . These data are the same as presented 
in Uohnston et al.l (|2007bf ). We note that there could be 
a level of systematic error in these measurements, pri- 
marily from the photometric redshift errors on the back- 
ground source galaxies used for the shear measurement. 
It is difficult to know this error, but simulations suggest 
the calibration is good to 10%. 

In order to get a size scale for each cluster sample, we 
fit a simple model to the data. The model is that of an 
NFW profile on small scales (|Navarro et al.l fl997f ) and 
linear correlations on large scales. This m odel was pre- 
sented in detail in Johnsto n et al.l (|2007bl ). For this pa- 
per, we only use this fit in order to estimate a size rV^ ss , 
from which we can also calculate the mass AA^oo- The 
r 2 QQ SS is the radius where the mean mass density falls to 
200 times the critical density, and M 2 qq is the mass con- 



tained within that radius. This radius ', 



200 



will be a ref- 



erence point for the M/L measurements. The fits for each 
richness bin are shown in Figure [7] and rV^ ss is marked 
for reference. Note, these values are s omewhat different 
than those in Uohnston et al.l (|2007b| ) where power-law 
inter polation was used to ex tract rV^ ss . 

In Uohnston et alj (|2007b| ) it is shown that if a fair 
fraction of the BCGs are not centered on the peak of 



the halo mass distribution, the shape can be strongly 
effected. In that work an offset distribution was deter- 
mined from simulations, and used to recover the under- 
lying halo mass distribution. In this work we do not try 
to recover halo masses, but work directly with the ob- 
servations around the BCGs chosen as centers. Thus it 
is important to keep in mind that these are the mass 
profiles (and light profiles) around BCGs, not necessar- 
ily peaks of the halo mass distribution. This is probably 
not a large effect for the M/L at the virial radius, and is 
negligible on the largest scales. On the other hand, for 
relatively small radii the shape of the M/L profile around 
BCGs may be different than that around halo centers. 

8.6. Mass-to-light Ratio Profiles 

In order to generate AM/ AL profiles we simply divide 
the integrated excess mass AM by the integrated excess 
light AL profiles. These profiles are shown for each clus- 
ter richness bin in Figure [H 

The AM/ AL is shown with and without the mean 
BCG luminosity. When the mean BCG luminosity is in- 
cluded the profile rises steeply and then flattens out at 
large radius. The r r {$f s is marked for reference. The 
mean AM/ AL within rlj^f s is a strong function of clus- 
ter richness. However, the asymptotic AM/ AL is nearly 
independent of cluster richness. 

Without the mean BCG luminosity included, the 
AM/ AL is relatively flat on intermediate to large scales, 
indicating that the relative amount of mass and light on 
those scales is not a strong function of radius. There 
is a turn up at small scales, however. This is partly 
due to the fact that we do not measure light on scales 
less than 25/i~ 1 kpc. However, there is also light not 
counted in our luminosity measurements. It is known 
that there is a significant amount of intra-cluster light 
(ICL), li ght not associated with galaxies, in m any clus- 
ters (e.g iGonzalez. Zabludoff. & Zaritskvll2005j ). There 
is also missing light from galaxies below the luminos- 
ity threshold 10 9 5 ft.~ 2 L Q and the light not counted from 
the outskirts of detected galaxies (probably dominated 
by the outskirts of the BCG on these scales). Only by 
estimating this missing light will we know the true M/L 
profile on small scales, and the absolute M/L for all ex- 
cess light in these clusters. 

In order to more quantitively describe the shape of the 
AM I AL curves, we use a simple fitting function that 
captures the main features of our profiles. It is a function 
which would describe the ratio of two equal-index power 
laws in mass and light plus a delta function for the mean 
BCG luminosity. 



AM/AL(< r) 



M (r /Ih^MpcY 



(L BCG ) + Lair/lh-mpcY 
(r/r 1/2 r \ (AM\ 
l + {r/r l/2 y) \AL) as 



,(19) 



where ri/ 2 = ((Lbcg) / Lq) 1 ^ is the radius at which 
the AM/ AL reaches half its asymptotic value at infinity 
(AM/ AL) asyrn . A larger ri/ 2 at fixed a implies a larger 
fraction of the total light is in the BCG, which results in 
a slower transition to the asymptotic AM/ AL. 

Although we know the mass and light profiles are not 
pure power laws, they tend to deviate from a power law in 
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Fig. 7. — Cumulative 3D excess mass profiles for each cluster N200 bin derived from weak gravitational lensing. The solid curves show 
the best fit NFW+linear bias model. The dotted curve is the corresponding linear model and the dot-dashed curve is the NFW only. The 
asterisk marks r^Q SS . Note these models do not account for possible offsets between the BCG and the halo mass peak, which would alter 
the profile on small scales. 
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Fig. 8. — Excess mass to excess light ratio profiles for each of the iV200 bins. Light is measured in the °- 25 i bandpass. These curves are 
the ratio of the curves shown in Figures [6] and [7] The points with error bars include the mean BCG luminosity, while the dotted curves 
exclude the luminosity of the BCG. The asterisk marks r^QQ SS . The curve through the data is a simple descriptive model as discussed in 
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similar ways, which partially cancels this error. However, 
this means that a should not be interpreted as the slope 
of the mass or light profile. 

The best fits for equation [19] are over-plotted in Figure 
[8] for the A200 binning. We use the full covariance ma- 
trix of the AM/AL, generated from the covariance ma- 
trices of AM and AL, for the fits. Using the covariance 
matrix accounts for the strong correlations in the errors 
caused by the radial integration. The best-fit ri/2, a, and 
(AM/AL) asym are listed in Tables Q] and [2] There is a 
weak trend of decreasing r 1 / 2 with -/V200 ; while a is rela- 
tively constant except for the highest richness bins. Re- 
call that r 1 / 2 = ((Lbcg) /Lq) 1 ^, and thus a smaller ri/ 2 
at fixed a is indicative of a less dominant BCG relative 
to the overall cluster luminosity, which we had already 
seen to be true in Figure [SJ However, at high masses the 
value of a increases also, weakening the change in r 1 / 2 
somewhat. We will discuss the (AM/ AL) asym values in 
section [~ 



AM 200 [h" 1 M Q ] 



f :mass 
'200 



8.7. Mass-to-light Ratio Within rV^ ss 

In Figure [9] we show the mean AM/AL within r 2 ' 
(AAf/Ai)2oo, for each of the cluster richness and lumi- 
nosity bins, plotted as a function of the mean A^oo- The 
AM/AL increases strongly with -/V200 over two orders of 
magnitude in mass. The AM200 is also shown on the top 
axis, but note this is rough as the transformation is non- 
linear. The (AM/AL)2oo versus AM200 is a good fit to 
a power law with index 0.33 ± 0.02. The (AM/AL) 200 
vs. N200, however, is not a good fit to a power law due 
to the non-linear relationship between mass and galaxy 
counts. 

Over-pl otted in Figur e [9] is a prediction based on the 
models in I Tinker et al.l (|2005l ) for z — rest-frame i- 
band light rather than 25 z. The predicted A200 have 
been scaled by a factor of 1.5 to pass over the points. We 
do not expect this prediction to match our data, which is 
in a different band and for which there are BCG center- 
ing effects. The point here is to show a rough expected 
trend with mass. There is qualitative agreement with 
the mass scaling of this prediction and our data. Note, it 
is tempting to think our data do not asymptote at high 
mass as expected from the model, but there is actually 
agreement at the one sigma level. 

8.8. Asymptotic Mass-to-light Ratio 

The AM/AL profiles shown in Figure [8] rise quickly 
and flatten at large separations. We measure this asymp- 
totic value in two separate ways. First we use the last 
point in the AM/AL curve at r <22/i~ 1 Mpc, which we 
will refer to as (AM/AL) 22Mpc . The (AM/AL) 22 Mpc 
for each richness bin is shown in Figure [TTJ] Note, the 
value at 22/i _1 Mpc should be insensitive to any offsets 
between the BCGs and the true halo mass peak. We 
detect no trend in the (AM/AL)22Mpc as a function of 
A^oo- For the average value we get 362 ± 54/i for the rich- 
ness binning and 349 ± 51h for the luminosity binning. 
Note these are not independent measures since the same 
clusters are used for both binnings. 

For the second method we examine the (AM / AL) asym 
values measured from the fits in £18.61 which are listed 
in Tables □ and [2] As with the (AM/AL) 22 Mpc val- 
ues, they are roughly constant with A^oo- The er- 
rors on (AM / AL) asym are much smaller than those on 
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Light is measured in the °- 25 i bandpass. The mean AAf200 within 
r 200 SS cen tered on BCGs is shown on the top axis, although this is 
only rough since the transformation between N200 an d mass is non- 
linear. No correction for offsets between the BCGs and the halo 
mass peaks is included. The (AAf/AL)200 vs AM200 is a good fit 
to a power law with index 0.33 ± 0.02; the (AM/AL)2oo vs -^200 
is not a good power law. The over-plotted cu rve is a prediction 
based on the simulations of Tinker ct al. (2005). This curve is not 
correctly matched to this data, but is rather intended to show the 
rough expected trend with mass. 
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Fig. 10. — Asymptotic excess mass-to-light ratio as a function 
of 7V200- Light is measured in the °- 25 j bandpass. This is simply 
the last point on the integrated AM/AL curve at r = 22fc _1 Mpc. 
The mean AM within r^QQ SS centered on BCGs is shown on the top 
axis, although this is only rough since the transformation between 
-/V200 a- n d mass is non-linear. The mean AM / AL= 362 ± 54h, 
averaged over all samples, is plotted as the horizontal line 
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rather than just the last data point to infer the asymp- 
totic mass-to-light ratio. We will discuss the interpreta- 
tion of these errors more in the discussion. 

The average model asymptotic mass-to-light ratios are 
357 ± 9h for the richness binning and 335 ± 9h for the 
luminosity binning. Note the two binnings are not inde- 
pendent measures as the same clusters are used for both. 
Both are consistent with the values we get just taking 
the last point (AM/AL) 2 2M P c- 

9. ERRORS 

As stated earlier, all errors come from jackknife re- 
sampling of the data. T he method was discussed in 
ISheldon et~aTI (2004, 2007) in context of the lensing mea- 
surements. The same technique was used for the light 
measurements. The main difference between jackknifing 
in the lensing and luminosity measurements is that for 
the luminosity correlations we must jackknife all pieces 
of the estimator in Equation [4] 

The systematics in the M/L measurements are dom- 
inated by calibration uncertainties in the lensing mea- 
surements, in particular the photometric redshift esti- 
mates for the background source galaxies. These er- 
rors were discussed £ 12.11 and more detail can be found in 
iJohnston et al. (2007b). Although it is difficult to know 
the absolute scale of the expected uncertainties, based 
upon the results of both sim ulations and real world tests 
shown in lLima et a l. (2008) we estimate the overall level 
of systematics to be of order 10% for all contributing 
factors. 

We want to stress again that the errors on the asymp- 
totic mass-to-light ratio differ greatly for the two meth- 
ods for a very simple reason: the (AM/AL) 2 2M P c is 
derived from the last point on the M/L curve and the 
(AM / AL) asym is derived from fitting a simple descrip- 
tive model to all points. Because this model is not a 
physical model the points on small scales should not be 
expected to be constraining of the large scale M/L. So 
the errors on (AM/ AL) asym must be thought of as lower 
limits at this stage. The errors on (AM/AL)22M P c can 
be thought as upper limits since there is certainly some 
information in points at smaller radius. 

10. DISCUSSION 

The integrated AM / AL around MaxBCG clusters 
has a generic form. At 25/i -1 kpc, where the light of 
the cluster is dominated by the BCG, the AM/ AL is 
~ 101i~ 2 Mq/ 'Lq. There is a sharp rise and then the 
profile flattens out at large scales (22/i _1 Mpc). We fit 
a simple model to extract Txii, the radius at which the 
AM/AL reaches half its value at infinity (see §8.61 for 
details). 

For Af2oo< 10 Mq, r 1 / 2 is determined primarily by 
the relative amount of light in the BCG compared to 
the rest of the cluster. For higher masses the AM/AL 
asymptotes to (AM/ AL) asym more quickly relative to 
the lower masses, resulting in a smaller rin. 

The decreasing rj/2 with richness is partly due to the 
less dominant BCGs for higher richness clusters; the ef- 
fect of the BCG on the integrated light is only important 
on small scales for very rich systems. But it is also partly 
because the measured AM/AL within the large clusters 
is closer to the universal value. Figure [5] indicates that 
the AM/AL not including the BCG light is more flat 



for the high richness clusters that have especially sharp 
transitions. The AM/AL would be essentially equal to 
(AM / AL) asym if it were not for the presence of the BCG. 
This is not true for the lower richness systems. 

This difference between high and low richness sys- 
tems leads us back to the discussion of uncounted light. 
Uncounted light is any light not counted in our mea- 
surements. This uncounted light is partly intra-clustcr 
light (ICL), light not associated with galaxies. There is 
also light from galaxies less luminous than the threshold 
10 9 ' 5 /i -2 Lq. The total luminosity of galaxies below this 
threshold is probably not dominant, and the radial profile 
is probably similar to that of galaxies above the thresh- 
old, so including it would not change the profile dramat- 
ically. But the ICL has a steeper profile. It appears to 
follow a cxp(— r 1 / 4 ) law, with scale length of order 100 
h^ 1 k pc, and contains man y times more light than the 
BCG ^Gonzalez et alJl2005[ ). The total light in this com- 
ponent scales slowly with richness, and is more dominant 
on small scales in smaller systems. This could explain 
what appears to be a slower rise in AM / AL relative to 
larger systems. We will explore the ICL for MaxBCG 
clusters in a future paper. 

The (AM/AL)2oo, the excess mass-to-light within 
r 200 SS > scales with richness and M 2 qo- For lower rich- 
ness systems, the (AM/AL) 2 oo is considerably smaller 
than (AM/ AL) asym , while for larger systems it is of or- 
der (AM / AL) asym . This trend is probably a reflection 
of both a true difference in mean AM/AL and the effect 
of the ICL, which may be more dominant for lower mass 
systems. The (AM/AL)2oo versus A200 is not a good 
fit to a power law, but (AM / AL) 2 oo versus AM 2 oo is 
well fit by a power law with index 0.33 ± 0.02 (there is a 
non- linear relationship between mass and galaxy counts). 
However, no attempt was made to model possible offsets 
between BCGs locations and the halo mass peaks. The 
AM/AL will not be strongly affected because the lumi- 
nosity roughly traces mass, the AM 2 no as a function o f 
7V 2 oo does change significantly (j Johnston et al.ll2007bh . 
A fully model-depende nt analysis for both the mass and 
light profiles following Joh nston et al.l (|2007bf ) may im- 
ply different results for (AAf/AL)2oo measured around 
dark matter halos than those around MaxBCG clusters. 

It is difficult to compare with the literature due to 
the many conventions and methods in use with regards 
to cluster selection, lower luminosity thresholds, galaxy 
aperture definitions, mass apertures and estimators, pro- 
jected vs. de-projected masses, luminosities with or 
without background subtraction, and the various band- 
passes used for the exposures. With these caveats, wc 
will say that there is broa d agreement in the literature 
that M/L oc M°- 2 -° - 3 (e.g. [Gi rardi et al.l l2000l iLin et all 
120041: iPopesso et al.ll2007r i. Below we compare the f2 m 
calculated from the inferred global mass-to-light ratio, 
which may be less dependent on these factors. 

We used two methods to extract the asymptotic 
AM/AL: the value with in 22/i~ 1 Mpc, (AM/AL) 22 m P c, 
and the best-fit value from our fitting function, 
(AM / AL) asym . Note, on these large scales, any off- 
sets between the BCGs positions and the halo mass peak 
is irrelevant. We see no trend of either measure of the 
asymptotic AM/AL with N 2 qq- 

As we discussed in the introduction, for any given clus- 
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ter sample, the asymptotic AM/AL is proportional to 
the mass-to-light ratio of the universe. Repeating equa- 
tion [2] here for clarity: 

(AM/AL) asym = (M/L) b- 2 ljL 

, _9 br.m. 1 



J M/L ' 



where b 2 M ^ L depends primarily on the bias of the galaxy 
tracers relative to the underlying mass distribution, since 
the bias of the clusters likely cancels out in b cm /b c g. 

Thus, if the bias b 2 M ^ L is independent of cluster rich- 
ness, then the lack of a trend of (AM/AL) 2 2Mpc with 
-ZV200 means we have measured the same asymptotic 
AM/AL for all richness bins. If b 2 M , L is not indepen- 
dent of -/V 2 ooj then by chance the variations in asymp- 
totic AM/AL were canceled by a corresponding change 
in the bias. So we need to determine whether this bias 
is constant. 

The bias b 2 M ^ L should primarily depend on the mass 
of the halos hosting the light tracers, and this should 
be related to the luminosity of those galaxies. Included 
in the Tables Q] and [2] is the mean i-band luminosity of 
galaxies at IO/1 -1 Mpc separations (£?q ) ■ The value on 
larger scales was not well constrained for all richness bins. 
There is not a strong variation of this mean luminosity 
between the richness bins. The average, over all cluster 
richness bins, of the luminosity of galaxies within lOMpc 
is L a .2 H = (1.10 ± 0.04) x lO lo /r 2 L (-20.43 ± 0.04 
mag). Note in $5]we saw that L%. 2&i = 1.7 x l0 w h~ 2 L Q . 
Given the small variation in luminosity, and the fact 
that the bias yaries quite slowly for L < L* galaxies 
(|Tegmark et al.l [20041 ) . the bias fr^/ L should be roughly 
constant for each cluster sample. Thus we will assume we 
are measuring the true asymptotic value at large separa- 
tions, and average these values from all cluster richness 
bins. 

In section 18.81 we calculated this asymptotic value 
in two ways: first by taking the value of the inte- 
grated AM/AL for the last radial bin 22/i~ 1 Mpc to get 
(AM/AL) 22Mpc , and the second fitting a simple descrip- 
tive model to get (AM/ AL) asym . Averaging over all 
cluster luminosity bins gives 



(M/L) b-* L = 



_ / 362 ± 54/i within 22/i~ 1 Mpc 



357 ± 9h asymptotic fit 



(20) 



in solar units, where the bias b 2 M ^ L corresponds to that 

of La-i5 i — 1.10 x 10 10 /i~ 2 L Q galaxies at z=0.25. 

In Sj5]we calculated that the luminosity density in the 
°- 25 i-band is (3.14±0.10) x 10 8 h LqMpc" 3 . Multiplying 
the asymptotic M/L above by this, and dividing by the 
critical density, number we get an estimate of Q m that 
is independent of h: 



O h~ 2 
1LmU M/L 



0.20 ± 0.03 within 22/j- 1 Mpc 
0.194 ± 0.008 asymptotic fit 



(21) 



There is certainly more information than we have used 
in the 22/i _1 Mpc values since points at smaller radius 
do contain independent information. So in principle a 
more precise measurement could be made, but the error 
within 22/i~ 1 Mpc can be considered conservative. On 
the other hand, the error bar on the asymptotic fit is 



certainly an underestimate, as the fit is not based on a 
physical model. The errors are small simply because all 
the points in the curve are used rather than just the last, 
but in fact the points on small scales are not necessarily 
informative in interpreting the points on large scales in 
absence of a physical model; this error should be thought 
of as a lower limit. 

In addition there is a level of systematic error not ac- 
counted for here. Although the level of systematic error 
is not precisely known, we expect it to be < 10%, mostly 
due to error s in photometric redshi fts of lensing source 
galaxies. See lJohnston et all (|2007bl ) for a more complete 
discussion of systematic errors. 

There have been numerous studies measuring mass-to- 
light ratios of clusters. As discussed above with regards 
to the (AM/AL) 2 oo--^2oo relation, there are a wide vari- 
ety of techniques and conventions in place. Many of these 
studies use the mass-to-light ratio to infer Q m by assum- 
ing the value they get is equal to that of the universe. 
Although this is not always a well-justified assumption, 
converting to Q m does remove most of the dependence 
on the bandpass, galaxy apertures, and mass aperture 
(as long as the mass aperture isn't too small). Using 
the inferred il m may lead to a more robust comparison 
between the various results. 

The series of papers by Bahcall et al. have consis- 
tently estimated tt m ~ 0.2 from this technique, using 
vari ous mass estimators. For exa mple, using X-ray clus- 
ters iBahcall fc Comerfordl (f2002h found 0.17 ± 0.05 and 
SDSS clusters whose masses wer e calibrated from ve loc- 
ity dispersions gave 0.19 ± 0.08 (jBahcall et al.ll2003h . 

Cluster velocity dispersions in the CNOC data have 
also been used to calculat e O m . Using stacked v elocity 
and £?-band light profiles, ICarlberg et alJ (|1997h found 
f2 m = 0.19 ± 0.06. A recent analysis usi ng individual 
masse s and if-band light found 0.22 ± 0.02 lMuzzin et al.l 
(l2007h . 

Using the "caustic" method for esti mating masses and 
if -band light in the CAIRNS survey. iRines et all (|2004D 
inferred Vl m — 0.18 ± 0.03. This method claims to yield 
more accurate masses by correctly modeling the infall 
regions around clus ters. The study of CNOC2 groups by 
iParker et all (|2005l ) using weak lensing for group masses 
and B-band light found Q, m = 0.22 ± 0.06. It should 
be noted that these are rather small groups; as we have 
shown in this work the mass-to-light ratio is less than 
universal within their virial radii. Using X-ray data for a 
set of 2MASS cl usters, and 2MASS if-band luminosities, 
iLin et al] (p003h find Sl m = 0.19 ± 0.03. 

Although there is general agreement when using mass- 
to-light ratios to infer r2 m , these measurements give 
smaller values for £l m than baryon fraction measure- 
ments, although baryon fraction measurements do have 
dependen c e on h. This is noted, for example, by 
ILin et al.l (|2003h where they find a higher Q, m = 0.28 ± 
0.03 using the baryon fraction using their own data, and 
suggest their mass-to-light ratios are too low. 

In our analysis we demonstrated that the mass-to-light 
ratio reaches an asymptotic value at large radius, which 
removes one possible error in determining the global 
M/L. The missed light, ICL and otherwise, will gener- 
ally push the mass-to-light ratios even lower. It is possi- 
ble that the b 2 M ^ L is considerably greater than unity, but 
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this is not the theoretical expectation. The light tracers 
are L < L* galaxies which should reside in under-bia sed 
halos (|Sheth fc TormenlH99l [Seliak fc Warrenll2004h . 

Note we have assumed a fiducial flat cosmology with 
= 0.27 when performing these measurements, and 
this has not been varied properly in order to constrain 
O m above. Thus the calculated Sl m could be slightly 
biased, and there could be some additional statistical er- 
ror not accounted for here. Generally, decreasing the 
assumed VL m increases the inferred £l m using this tech- 
nique, through it's affect on the critical density for lens- 
ing. But at these redshifts such effects are secondary, 
well below our errors. In a future paper we will present a 
full cosmological analysis including these effects as well 
as a proper model for BCG displacements f r om the halo 
mass peak as carried out in I Johnston et all (|2007bl ). 

These results are a precise test for models of structure 
and galaxy formation. The M/L results, coupled with 
the galaxy color and luminosity distributions as a func- 
tion of radius from clusters (sections 18 . 1 1 and l8 . 3|) show in 
detail how different types of galaxies are distributed in 
and around these clusters and how they are clustered 
relative to the underlying mass distributio n (see also 
lHansen. Sheldon. Wechsler. fc Koesterll2007t ). These are 
the most basic cross-correlation statistics that can be ad- 
dressed, and are perhaps the most precise and powerful 
statistics that can be measured by a photometric survey. 

It is significant that these measurements were car- 
ried out in a purely photometric data set. Every step 
of the process uses photometric data only, from cluster 
finding, to galaxy cross-correlations, to lensing measure- 
ments. These type of measurements can be carried out 
in any high-quality survey with properly chosen band- 
passes. Future surveys such as DES, SNAP, and LSST 
will greatly extend these measurements and further chal- 
lenge our theories of cosmology and galaxy formation. 
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